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Abstract 

A complex contagion is an infectious process in which individuals may require multiple transmissions 
before changing state. These are used to model behaviors if an individual only adopts a particular 
behavior after perceiving a consensus among others. We may think of individuals as beginning inactive 
and becoming active once they are contacted by a sufficient number of active partners. These have 
been studied in a number of cases, but analytic models for the dynamic spread of complex contagions 
are typically complex. Here we study the dynamics of the Watts Threshold Model (WTM) assuming 
that transmission occurs in continuous time as a Poisson process, or in discrete time where individuals 
transmit to all partners in the time step following their infection. We adapt techniques developed for 
infectious disease modeling to develop an analyze analytic models for the dynamics of the WTM in 
Configuration Model networks and a class of random clustered (triangle-based) networks. The resulting 
model is relatively simple and compact. We use it to gain insights into the dynamics of the contagion. 
Taking the infinite population limit, we derive conditions under which cascades happen with an arbitrarily 
small initial proportion active, confirming a hypothesis of Watts for this case. We also observe hybrid 
phase transitions when cascades are not possible for small initial conditions, but occur for large enough 
initial conditions. We derive sufficient conditions for this hybrid phase transition to occur. We show 
that in many cases, if the hybrid phase transition occurs, then all individuals eventually become active. 
Finally, we discuss the role clustering plays in facilitating or impeding the spread and find that the 
hypothesis of Watts that was confirmed in Configuration Model networks does not hold in general. This 
approach allows us to unify many existing disparate observations and derive new results. 

key words: Watts Threshold Model, Cascades, Hybrid Bifurcation, Complex Contagion, Dynamic 
model 


1 Introduction 

Many “infectious” processes spread on social contact networks. The most studied of these are susceptible- 
infected-susceptible (SIS) and susceptible-infected-recovered (SIR) diseases. For these processes, a single 
transmission will cause infection if the recipient is susceptible. However, a range of behavioral patterns such 
as rumor spread or technology use are believed to spread as “complex contagions”. Here, a node in the 
network typically requires more than one transmission from “active” nodes to become “activated” [42] . 

These have received less attention compared to infectious diseases. However, their importance for under¬ 
standing large-scale social behaviors is significant. In understanding processes for which individuals adopt a 
behavior if they sense that there is a consensus in favor of that behavior, a simple SIS- or SIR-likc disease 
model will be unable to capture the dynamics. Experimental evidence [J provides support for the hypothesis 
that people adopt a behavior when they perceive such a consensus, and that this has an impact on the global 
dynamics. Such behaviors are important for understanding how bubbles form in stock markets or how new 
technologies are adopted. 

We will study the Watts Threshold Model, model introduced by [42] . In this model, each node u is 
assigned a threshold r u such that if it has at least r u active neighbors it becomes active. We allow the 
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threshold to be a random variable depending on the degree of node u. If r u < 0, then u is active at the 
initial time. In frequently studied cases, r u is taken to be same for all nodes or r u = ak u represents the 
same proportion of neighbors. By careful choice of the rule for assigning r u , we can arrive at a range of 
other percolation processes as special cases of the WTM [50], including bond, site, k- core, and bootstrap 
percolation. 

We will focus most of our attention on a continuous time version of the WTM, using an asynchronous 
update rule. A node u receives transmission independently from each of its active neighbors at rate /3. We 
also consider a discrete time version in which active nodes transmit to their neighbors with probability 1 in the 
time step after becoming active. In either case, the transmission could be the transfer of an actual infectious 
agent, or it might simply be the observation that v has adopted some new behavior. This is sometimes 
referred to as a “message” which is passed m na, but in this study we will use the term “transmit”. Once 
node u has received r u transmissions, it immediately switches to active and then begins to transmits to its 
neighbors. 

To study this, we will adapt a compartmental modeling approach focusing on the status of edges rather 
than nodes. This “Edge-based compartmental modeling” technique was originally developed for SIR disease 
spread and can be applied with a wide range of assumptions about the population or disease process [2811321 
[53] . It allows for a significant reduction in the number of equations used compared to other approaches [22 , 
[2;,:9j [38], in some cases reducing an infinite (or even doubly infinite) system of differential equations to an 
equivalent system governed by a single differential equation mm- We will see that the same approach can 
give a significant simplification here as well. Because the WTM contains many other percolation processes 
as special cases [3D], this approach can be adapted to those as well. 

The primary network class we study is the Configuration Model networks, a class of random networks 
which are determined by a degree distribution. These networks have a negligible amount of short cycles: 
the existence of an edge from u to v and u to w provides no information about the length of alternate paths 
between v and w. Consequently, as long as u remains inactive, knowledge about the status of v provides no 
information about the status of w. 

It is generally believed that clustering enhances the spread of complex contagions mum and ex¬ 
perimental evidence supports this [4j. The basis for this argument is that clustering will tend to introduce 
correlations between the statuses of a node’s neighbors. Thus a node who sees one active neighbor is more 
likely to see others, and thus more likely to activate. In contrast it is fairly well established that clustering 
tends to reduce the spread of SIR diseases, even if only mildly [2D, ,TD[ [27J [2TJ HU [27, '23]. Work by [13] 
suggests that clustering may increase or decrease the final size of WTM cascades. We will use our model 
to study how clustering alters the dynamics of WTM cascades by studying a specialized model of clustered 
networks independently introduced by [26| [5D]. 

In our study we find hybrid phase transitions as the initial active proportion increases. That is, the final 
size has a square root scaling on one side of the bifurcation point with a large jump in size at the transition. 
These appear to be common in the WTM. We discuss some of the examples we encounter. Although these 
do not appear to have been seen in the WTM when initial proportion is the bifurcation parameter, they have 
been seen in several types of percolation that are a special case of the WTM 00181 and a similar bifurcation 
exists for the WTM with average degree as the bifurcation parameter, so this result is not surprising of its 
own right. We are able to find some sufficient conditions for the hybrid phase transition to occur. In 
particular, there is a hybrid phase transition if r < k — 1 for all nodes and neither the r = 1 nodes nor 
the r = k — 1 nodes form a giant component. Furthermore in this scenario we anticipate that above this 
threshold all nodes in the giant component eventually activate, but we do not have a rigorous proof. 

Related analytic work has primarily focused on the final size of cascades [nuii Elisa Bn- There has 
been limited work investigating dynamic spread. In particular section III of mi (see also [24] ) uses equations 
that implicitly assume that once a node reaches its threshold of active neighbors there is a delay before it 
becomes active as and once active it immediately transmits to all of its neighbors. Another model of [3D) 
allows a transmission rate /3 and focused on the case of a fixed value of r with a final transmission probability 
less than 1. This paper derived equations similar to those we will derive for the continuous time spread in 
configuration model networks, but did not analyze them in detail. 
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In this paper, we demonstrate a mathematical approach that leads to a straightforward derivation of the 
governing equations. We then analyze the resulting equations, introducing some new results on the behavior 
of cascades and unifying a range of disparate results about the WTM in random networks. 

2 The test node 

Our derivation of the governing equations follows the “edge-based compartmental modeling” approach of [32, 
133} l34j . This approach is based on the observation that if the population-scale dynamics of an epidemic are 
deterministic, then the probability a random node has a given status is equal to the proportion of the 
population that has that status. We perform a subtle shift of focus: rather than trying to calculate the 
proportion of the population that is in each status, we find it to be easier to calculate the probability a 
random node has each status. 

We will consider a randomly chosen node and modify it such that it cannot transmit infection. When 
first seen, this modification is often confusing, and we explain it in more detail below. It is equivalent 
to the “cavity method” used in the message-passing approaches m- The resulting model is based on 
probability generating functions, and so it might be more appropriately called a “probability generating 
function” method. However we avoid this term because depending on the details of the situation modeled, 
the resulting equations may not be probability generating functions. Indeed in our case, we will see functions 
that are similar to probability generating functions, with subtle differences. 

The basic derivation can be summarized in four steps: 

• We begin with the hypothesis that the population-scale dynamics are deterministic, but node-scale 
events are stochastic. 

• If the dynamics are deterministic, then the specific timing of when (or even whether) a random node u 
is activated has negligible impact on the population-scale dynamics. So removing u from the population 
would have negligible impact on the dynamics. 

• Rather than removing u we simply prevent it from transmitting to its neighbors, but allow it to 
otherwise be as before. The resulting size is larger than would be seen if u were removed, but smaller 
than if u were not modified. Since the difference between these is negligible, the resulting dynamics 
are indistinguishable from the original. 

• Preventing u from transmitting has no impact on zt’s status. Since u is a randomly chosen node, the 
probability it would have a given status at a given time must match the proportion of the population 
that would have that status at that time (because the dynamics are deterministic). Thus calculating 
the probability the modified u has a given status is equivalent to calculating the proportion of the 
population with that given status if u were not modified. 

We note that a careful analysis shows that the second and third points are actually unneeded. While 
they are true, the key feature is that if the dynamics are deterministic then the probability an unaltered 
randomly chosen node u has a given status equals the proportion of the population with that status, and the 
probability the unaltered node u has a given status equals the probability that it has that status even after 
alteration. Thus calculating the probability the altered node u has a given status tells us the proportion of 
the population with each status in the original population. 

We call the altered randomly chosen node u a test node. In the configuration model case, the fact that 
u cannot transmit to its partners means that their statuses are independent of one another. We will find a 
consistency equation which gives us the probability that a random neighbor of u is still inactive, which in 
turn allows us to determine the probability u is active or inactive. 
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Variable 

Description 

P(k, r) 

The probability that a random node has degree k and threshold r. 

P 

Transmission rate from active node to partner. 

Q(t ) 

The probability a test node u is still quiescent (inactive). 

A{t) 

The probability a test node u is active. 

<t>a(t) 

The probability a random neighbor of a test node u is still quiescent. 

<t>A{t) 

The probability a random neighbor of a test node u is active but has not yet 
transmitted to u. 

0(t) = <t> Q {t ) + 4> A {t) 

The probability a random neighbor of the test node u has not transmitted 
to u. 

1 - 9(t) 

The probability a random neighbor of the test node u has transmitted to u. 

ts S 0 [/(*)]!*=«, 

o f^\ x o)( x — x oy/j- the Taylor Series of f(x) centered at x 0 , evaluated 
at x± and truncated at the nth term. 

4>r,CM{x) 

P(k, r ) xk a function related to probability generating functions which 
will be used to determine the proportion of the population that is both 
quiescent and has threshold r. 

(K)=E k , r kP(k,r) 

The average degree of nodes in the population 


Table 1: The variables used in the continuous time model for transmission in a configuration model network 

3 The WTM in Configuration Model networks 

A Configuration Model network is generated by assigning each node a degree k independently of any other 
degrees assigned in the population. Each node is then given k “stubs” and stubs are randomly paired until 
each stub is in exactly one pair which forms an edge. We use P{k,r) to be the probability a random node 
has degree k and threshold r. A neighbor of a random node has degree k and threshold r with probability 
kP{k,r)/ ( K) where ( K) is the average degree. This size bias represents the fact that the probability v is a 
neighbor of a random node is proportional to the number of neighbors v has. 

A consequence of the formation process is that a random neighbor of a node has a given degree k with 
probability P n {k,r) = kP(k,r)/ (K) where ( K) represents the average degree. We note that a neighbor’s 
random neighbor has a given k and r also with probability P(k,r). We will determine the probability a 
random neighbor v of the test node u has not yet transmitted to u in terms of the probability that v’s 
random neighbors have not transmitted to u. Since the neighbors of v (other than u) are chosen from the 
same distribution as v, this will be the step that leads us to a consistency equation for the probability v is 
still inactive in terms of the probability that neighbors of v are still inactive. 

3.1 Continuous time 

We begin by studying a continuous time model. In this model an active neighbor v of u (with threshold 
r u ) transmits to u at rate /? (as a Poisson process). Once the r u th neighbor of u has transmitted to u, u 
immediately activates. Once a node (other than the test node) activates it begins transmitting immediately. 
Note that we do not count repeated transmissions from v to u. 

We let u be the randomly chosen test node and v be a random neighbor of u. If w is another neighbor 
of u. the status of v is independent of the status of w because u is prevented from transmitting and thus 
information about one cannot pass to the other. 

We define 9{t) to be the probability that v has not yet transmitted to u. We break 6(t) into two parts: 
4>Q{t), the probability v has not yet transmitted and is quiescent, and 4>A(t), the probability v has not yet 
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Figure 1: (left) A flow diagram showing the transitions between the possible states for v: From being 
quiescent (with probability 4>q) to being active but having not yet transmitted to u (with probability (/)a ) 
to having transmitted (with probability 1 — 6). (right) A flow diagram showing the transitions for possible 
states for it: from quiescent to active. 


transmitted to u but is active. Then 8(t) = 4>Q{t) + </qi(i) and 1 — 9(t) is the probability v has transmitted 
to it. We demonstrate the flow of probability between these compartments in figure [Q 

The probability u is still quiescent is the probability that fewer than r neighbors have transmitted to u. 
Taking m to be the number of transmissions u has received, and summing over all possible values of k 7 r, 
and to < r we get 


^) = EEE p ( fc - 

k r >0 m —0 
,— 1 /t (m) 

= EE 

r>0 m=0 


WcUm 


ml 


8(t) k ~ m (l — 6) r 

(i-mr 


where tl> r ,CM(x) = Y^k r)x k . Later when we consider networks with triangles, there will be a related 
function tp r ,A- Note that the inner sum gives the first terms of the Taylor Series approximation for Vv,Cm(1) 
expanded at 9(t). We take advantage of this to reduce the summation to 


Q(t ) = E TS e ( t)^ ii > CfM ( a: )]l a!= i 

r>0 

where TS”[/(a;)] = ~ x o) J f U) (xo)/j'- is the Taylor Series of / centered at Xo and truncated at the 

/(")th term. 

If we have 9(t ), this gives us Q(t ), from which we can immediately find the probability u is active: 
A(t) = 1 — Q(t). Thus we need an expression for 9(t), the probability a random neighbor v has not 
transmitted to u. The probability v has degree k and threshold r is given by kP(k 7 r)/ (K). Given the 
degree k and an m < r, the probability v has received to transmissions is ( -1 )0(t) fe-1-m ( 1 — 9{t)) m . The 
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k — 1 occurs because we know that u will not transmit to v. Thus the probability v is quiescent is 


r —1 


^) = EEE 


kP(k, r ) fk—1 


k r>0 m=0 

l 

Jk) 


(K) \ m 


*(*) 


fc—1 —m 


a -mr 




r >0 ra =0 


74E TS ^)Kcm(^)]|x=i, 


r>0 

with </>a(£) = 0(i) — <pQ(t). To find 6(t), we simply observe that 

9 = ~/34>a ■ 

Substituting for 4>a in terms of 9, our full system of equations is 


0 = -P ^E TS mK cm(*)]U=i 

Q=E TS sr*)^cM(*)]i*= 1 

r>0 

A = l-Q. 

These equations are very similar to those of [40] who studied the case of a fixed value of r with a final 
transmission probability less than 1. 

The single ODE for 9 governs the dynamics. As initial conditions, we assume that a fraction p of the 
population is initially active. This appears in the equations as setting P(k, 0) to be nonzero for some k 
and reducing the value of P(k,r) for other r values. If the fraction is chosen uniformly, then each 'i/y.CM 
function will contain a factor 1 — p, that is Vv,Cm(#|p) = (1 — p)'i/v,CM(0|O) where the notation Vv,cm(0|p) 
denotes the value of Vv,Cm( 0) given p. These equations predict similar dynamics to those of section III 
of mi (see also [3]), but those equations implicitly assume that once a node reaches its threshold of active 
neighbors it becomes active as a Poisson process and once active immediately transmits to all of its neighbors. 
In contrast, here nodes immediately become active and then transmit independently to its neighbors as a 
Poisson process. In terms of when a node v transmits to a given neighbor u, it does not matter whether the 
delay occurs after v reaches its threshold but before it becomes active or after v becomes active but before 
it transmits. However, this introduces correlations in that all neighbors of a given node receive transmission 
at the same time. In populations in which there are short cycles, these correlations can cause correlations 
in the times that a single node receives transmissions. Thus these two models have different dynamics in 
clustered networks. 

Note that as t —> oo, the value of 9 must approach a constant, so 4>a — > 0. This leads to 0(oo) = ^q(oo) 
from which we can find a formula for the final size. 

A(oo) = 1 - E TS e("«)[^ r ’ CM ^)]l a:=1 ’ 

r>0 


where 

0 M = 7^rE TS er^)[^.cM(^)]U=i • 

' ' r>0 

This is equivalent to an expression found by m- The simplest way to solve this is through iteration taking 
an initial guess for 9 of 9 = 1. We see below that this iteration corresponds to the discrete-time version of 
the model. 
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Figure 2: The WTM in a network having nodes equally distributed between degrees 2, 4, and 6. A random 
fraction p is initially active, with p ranging from 3/80 to 7/80 (left) and 3/40 to 7/40 (right). The other nodes 
either have fixed r = 2 (left) or fixed proportion r = k/2 (right). Solid curves are (Gillespie) simulations in 
a population of 400000, dashed curves are theoretical predictions. As p increases, there is a sudden jump in 
the final size. The jump is abrupt, with no initial conditions leading to an intermediate final size. Close to 
this critical value, the dynamics are sensitive to stochastic effects, but elsewhere the predictions match well. 


Figure [2] confirms that the model is accurate for spread in Configuration Model networks. We are able 
to see a hybrid phase transition, which we discuss below. If the initial active proportion is too small, the 
transmissions that occur are not enough to create significant new activations, so the numbers level off and 
spread halts. At larger initial conditions, the numbers begin to level off, but a large proportion of the 
population are very close to reaching their activation threshold. The large pool of almost active nodes 
initiates a new wave of activations and a cascade begins. A similar behavior was observed in [25] where it 
was attributed to having multiple types of active nodes, but we see here that it is possible with just one. It 
is known that at the hybrid phase transition threshold, correlation lengths diverge (in k -core percolation for 
N —> oo) Em This long correlation magnifies stochastic effects, so close to this threshold the simulations 
diverge somewhat from the predictions. 

3.2 Discrete time 

We now consider a discrete time model, with synchronous updating. Time progresses in integer units. At 
each time step, if a quiescent node of threshold r has at least r active neighbors, it transitions to active for 
the following time step. We use t = 0,1,... to denote the discrete time and we find that the expressions for 
Q(t), <pQ(t), and in terms of 9(t) remain the same as in the continuous time model. However, the rule 
for how 9 updates changes 


0(t) =4> Q {t- 1) 

= TjC) 5Z T ®9(t-l)E^ , r,CM( a; )]U=l , (1) 

for t = 1,2,... and 9(0) = 1. We will be interested in how the results change as p changes. To simplify our 
notation, we define f(x) to be the right hand side of this 

/(*) = 7TT^ TS ^ _1 [^,Cm(^)]U=i • 
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Figure 3: Cobweb diagrams for networks with r = 2 for all nodes, the degrees equally distributed between 
2, 4, and 6, and a randomly chosen initially active fraction p = 0.02 (top left), 0.055 (top right), and 0.2 
(bottom left) initially active. Insets show detail for the two small values of p. This corresponds to the left 
hand plot of figure O In each case the initial 9 is 0(0) = 1, and each successive value 9{t) follows from 
equation As p increases, there is a saddle-node bifurcation resulting in a discontinuous shift in 9(oo). A 
similar bifurcation causes the jump in final sizes seen in figure (|2|l. The final plot shows the values of 9{ oo) 
as a function of p. Note the discontinuous jump with a square root scaling close to the threshold. 


This has a dependence on p through '(pr, cm- When we want to make the dependence on p explicit, we write 
f(x\p). Assuming that the initial active nodes are chosen randomly, we have f{x\p) = (1 — p)f(x\0). 

Note that the final size derived in the continuous dynamics case solve 9 = f(9). The final value 0(oo) 
will be the same for this model and the continuous model. This is because in both models any node that 
becomes active will eventually transmit to all of its neighbors. The timing of those transmissions is irrelevant 
to the cumulative effect on the recipient. The final state is the same. 

The dynamics resulting from equation ([1]) can be understood through the cobweb diagrams in figure [3] 
Here we plot f{9) and the diagonal line 9{t + 1) = 9(t). The cobweb diagram allows us to graphically iterate 
/ to find successive values of 9. 

This cobweb diagram gives insight into the phase transition. We can use geometric arguments to show that 
as p approaches the critical value p c from below, the resulting value of 9( oo) scales like 9(oo) — 9 c ~ {pc — p) 1 ^ 2 . 
We can rotate and flip the coordinates such that the diagonal 9{t + 1) = 9(t) is the horizontal axis and then 
f(9) becomes locally a parabola that crosses the (new) horizontal axis very close to its local minimum. By 
appropriately scaling and choice of 0, we can represent this parabola as x 2 — c. The parabola crosses at 
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x = ±y/c. Changes in p c — p correspond to proportional changes in c, with c = 0 at the critical value of p. 
The scaling of x as \/c corresponds to the scaling of 0(oo) — 9 C . So generically when this sort of bifurcation 
happens, the location of the fixed point has the square root scaling we have identified. 

We now look at this with a more analytical approach to achieve the same result. We use the relation 
f{9\p) = (1 — p)f(0 10). At p = p c , we have 6 C = (1 — p c )f{9 c |0) where (1 — p c )f'(9 c \0) = 1. If we decrease p 
slightly to p c + p (with p < 0), we have a new limiting 9(po) = 9 C + 9 where 

9 C + 9 = (1 — p c — p)f(9 c + 0|O). 

We expand / as a Taylor Series. The obvious truncation of / as f(9 c |0) + 9f(9 c |0) fails because (1 — 
Pc)f'{9 c |0) = 1, so the linear term (1 — p c )f'{9 c )9 is simply 9. The right hand side becomes 

(1 -Pc- p)f(9c + <9|0) = (1 - Pc)f(9c |0) - pf(9c |0) 

+ (1 — Pc)f(9 c \0)0 4- 

= 9 C + 9 — pf(9 c |0) + • • • . 

If we do not account for the neglected terms we are left 0 = —pf(9 c 10), which is wrong. We must expand / to 
second order to give 0 = — pf(9 c \0)+(l—p c )f"(9 c \0)9 1 2 /2. We conclude that 9 ~ y / 2/5/(0 c |O)/[(l — p c )f"(9 c |0)] 
as p —> 0“. This square root dependence of 9 on p defines a hybrid phase transition mm- 

3.2.1 Alternative interpretation of discrete time model 

We can alternately interpret the discrete model in terms of transmission chains of given lengths. We choose 
a node u and a random neighbor v. We prevent u from transmitting to v and v from transmitting to its 
neighbors other than u. We allow that the possibility a random neighbor of v transmits to v might be 
different from the probability v transmits to u (ultimately we will see that the values are the same, but 
for now we allow this possibility). If 9 U is the probability v does not transmit to u then in fact it is the 
probability v remains quiescent (otherwise it would eventually transmit). If 9 V is the probability a random 
other neighbor of v does not transmit to v, then we get 

0u = • (2) 

' ' r >0 

We see that if r v = k v , even if all neighbors other than u transmit to v, that will not be enough to push v 
over its threshold, so v will not transmit to u. Note that this implies that if every node has a threshold at 
least 1 less than its degree, then in the cobweb diagrams f(9) goes through 0 at 9 = 0. 

We can derive an equivalent expression to equation ([2]) for 9 V in terms of the neighbors of v , and for 
those neighbors in terms of their neighbors. Continuing this, we can define 9(t) to be the probability that a 
random neighbor of u will eventually transmit to u if we discard all nodes of distance greater than t from u. 
Then 9{t) follows the same recurrence seen before!)] 

' ' r >0 


reproducing equation ©.. 

1 This interpretation explains a potentially puzzling feature of cobweb diagrams in figure i3l namely that, for example, if we 
take 6(t — 1) = 0, we find 0{t) > 0. Previously the interpretation of this statement would be that if the probability a neighbor 

has transmitted prior to time t — 1 was 1, then the probability it has transmitted prior to the later time t is less than 1, which 
of course is nonsensical since the probability an event has not happened can only decrease in time. This does not present a 
problem for the previous analysis because the relevant initial value is 0(0) = 1, so the system never reaches this scenario. 
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3.3 Threshold condition 

In 22 it is suggested that a global cascade can only occur when the r = 1 nodes form a giant component: 

In the context of this model, we conjecture that the required condition for a global cascade is 
that the subnetwork of vulnerable vertices must percolate throughout the network as a whole, 
which is to say that the largest, connected vulnerable cluster must occupy a finite fraction of an 
infinite network.’ 

This conjecture was made assuming the p —> 0 limit. We will show that for Configuration Model networks 
this conjecture holds for the p —> 0 limit, but that hybrid phase transitions can occur if p is large enough. 
Later we will see that in the presence of clustering this conjecture is not true. 

To see the cascade condition for p — > 0, consider the cobweb diagram of figure[3] In the limit p — > 0, we find 
/ = 1 at 0 = 1. A global cascade is possible in this limit only if /'(l) > 1. Note that TS^” 1 ^ CM (cc)]| a , = i = 
ip' r cmW + ^((l ~ X Y) because it is the first r — 1 terms of the Taylor Series for ?// CM (1) centered at x. 
It follows that only the r = 1 term in equation m gives a nonzero contribution to the slope of f(9) at 
0 = 1 for p — > 0. The leading order term of /'(1) in the p — > 0 limit is YicmY)/ {&)■ So only when 
V’i cmW/ (-^0 > 1 can a global cascade begin from an arbitrarily small randomly chosen initial fraction p. 
Because ip” CM (1) = T(fc, !■)&(& — 1), we see that this is a statement about the degree and frequency of 
r = 1 nodes. In fact it says that if a global cascade can begin from an arbitrarily small randomly chosen 
initial condition then the r = 1 nodes form a giant connected component of the population. 

We can also derive this threshold through a physical interpretation. If the initial p is small, then the 
condition for a cascade is the same as the condition that the “infection” travels infinitely far from an initial 
source. Each active node will transmit to all of its neighbors, but until the cascade grows, we may assume 
that the network is tree-like. So the transmission leads to activation iff the neighbor has r = 1. Thus the 
disease can be thought of as first exploring the largest component made up of r = 1 nodes around the initial 
node. Once it has reached all of these nodes, it can only infect r = 2 or higher nodes if they have multiple 
neighbors in the initial r = 1 component. However, in a Configuration Model network, the probability the 
absence of short cycles implies that there are no such nodes unless the r = 1 component percolates the 
network. This generalizes theorem 5 of pQ where the same result was proven under the assumption that the 
threshold r u is a function of the degree k u of u (although the proof here has been less rigorous). 

3.3.1 Sufficient condition for the hybrid phase transition 

We now consider the alternate question of under what condition can there be a hybrid phase transition? In 
general this appears to be a difficult problem. However, if we assume that every node’s threshold r u satisfies 
r u < k u — 1, we can make some progress. In this case, for p = 0, there is a fixed point at 0 and at 1. The 
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If P(k, k — 1 )k(k — 1) < ( K }, then the slope at 0 = 0 is less than 1. That is, if the nodes with r = k — 1 do 
not form a giant component the slope at 9 = 0 is less than 1. At small 9 the curve lies below the diagonal!! 
If additionally there is no cascade for arbitrarily small p, then the derivative of the curve is less than 1 at 
9 = 1. So if r u < k u — 1 for every node u and neither the r = 1 nodes nor the r = k — 1 nodes form a 
giant component, then for 9 close to 0, the curve is below the diagonal, while for 9 close to 1 it is above the 
diagonal!^. This forces the existence of at least one additional fixed point strictly between 0 and 1. Take the 
largest of these. As p increases, this fixed point moves right, while the fixed point at 1 moves left. Eventually 
these two meet and annihilate in a saddle-node bifurcation, resulting in a hybrid phase transition. We see 
this in figure H] for which k = 4 and r = 2 for all nodes. 

In fact, if both of these conditions hold, then it is likely that after the hybrid phase transition, all nodes 
become active. The fixed point at 0 remains at 0 for all p. Unless there are additional fixed points below 9 C 
when the bifurcation occurs, the system will immediately move to the only remaining fixed point, 9 = 0. In 
this case, all edges eventually transmit, and all nodes eventually become active. 

More generally, as we see in figure[3]we can have a cascade even when r u > k u for some nodes. If there is 
no cascade from arbitrarily small p and there is any value of p for which f(9\p) goes below and then above 
the diagonal, then at a larger value of p this bifurcation will occur. This is what we see in figure [3] In this 
case, the cascade fails to reach the entire population. 

We can generalize our sufficient condition somewhat. If we remove the r > k — 1 nodes from the network, 
we are left with a new configuration model network. Some nodes may now have a threshold r greater than 
or equal to their new k — 1. If we repeat this process until no more nodes are removed, and the resulting 
core network has a giant component, then we will satisfy the condition for a hybrid phase transition. 


3.3.2 Necessary conditions for hybrid phase transition 

We now show that the hybrid phase transition requires that r > 1 for at least some nodes. Physically this 
makes sense because if r = 1 for all nodes, then any node in a connected component containing an initially 
active node will eventually activate. We then find another necessary condition on / which is harder to 
translate back into r. 

2 Note that in figure [3] the slope of f(0) at 9 = 0 is 0 because P(fc, k — 1) = 0. The value of / at 9 = 0 is nonzero because 
r u is not less than k u for all nodes 

3 Technically we require that the concentration ofr = lorr = fc — 1 nodes must be strictly away from the threshold to form 
a giant component such that the inequalities are strict. Otherwise k = 2, r = 1 for all nodes provides a counterexample to this 
result. 
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Figure 4: Demonstrating the bifurcation when r < k — 1 and neither the nodes with r = k — 1 nor r = 1 
form a giant component. Here k = 4 and r = 2 for all nodes. When p = 0, there are fixed points (places 
where f{6) intersects the diagonal) at 0, 1, and an intermediate value. As p increases, this intermediate 
value moves right, while the fixed point at 1 moves left. Eventually the two meet and annihilate one another 
in a saddle-node bifurcation. After the bifurcation, the only remaining fixed point is at 0. 
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When the saddle-node bifurcation occurs, The derivative of f(8\p) must decrease through 1 at the bifur¬ 
cation point. This means that it must have a negative second derivative. Since f{8\p) = (1 — p)f(6 10), this 
means that /(0|O) must have negative second derivative somewhere. Taking the second derivative of f{8 10) 
gives 


r— 1 


E E i ( ^Sm 3) m - or - mvS 2) w( 1 - er - 1 


r>0 m =0 


+ m(m - 1)^m 3) ( 6 ')( 1 - 0) 


Because ij) is defined as a summation with positive coefficients, each derivative of ij) is non-negative. In order 
for the expression above to be negative, the middle term, mip!^^\8)( 1 — 0) m_1 must be nonzero for at 
least one value of m. If r < 1 for all nodes, then when r < 1, the sum is empty and when r = 1 the only 
term in the sum over m has m = 0, so the term is zero. So a hybrid phase transition in a configuration 
model network requires r > 1 for some nodes. With minor modifications, this shows that site percolation 
(on configuration model networks) cannot exhibit this phase transition. 

Note that we can also show that if having a cascade requires p above some positive threshold, then 
f"(8 10) must change sign between 0 and 1. This is because just before the bifurcation happens, there must 
be (at least) three solutions to 9 = f(8\p). The top of these three has /' < 1, the middle has /' > 1 and 
the bottom has f < 1. Since the average second derivative must be positive in one interval and negative in 
the other, it must have a sign change somewhere. It is not clear if this leads to any simple statements about 
necessary conditions for a hybrid phase transition. 


4 Triangle-based networks 

Many social networks exhibit clustering, and so the status of one neighbor v of u is likely to be correlated 
to the status of another neighbor w. This assumptions of independence of neighbors breaks down. For this 
case, ad hoc approaches are able to make some progress [16] . However, in particular limits analytic results 
are possible. 

In [251 [36] . Newman and Miller independently introduced a model of random clustered networks. In 
these networks, nodes are assigned two degrees, an independent degree kj and a triangle degree k/\ with 
probability P(fc/, h&). a node is then given kj regular stubs and A’a triangle stubs. Regular stubs are joined 
into pairs to define edges, while triangle stubs are joined into triples to form triangles, and so a node will 
have degree Aq + 2 &a- These networks have clustering, but mimic some properties of “tree-like” networks. 
This tree-like property makes them amenable to analytic methods, and m showed that it is possible to 
apply the edge-based compartmental modeling approach to SIR disease in such networks. 

A significant weakness of this network model which must be highlighted is that triangles in these networks 
do not share edges. In fact, locally these are a special case of “cactus graphs” or “Husimi trees” [15], that is, 
each edge is in at most one cycle. So although some clustering is introduced, the structure is limited. This 
can be generalized to more complicated motifs than triangles [2011251155] . but it is still limited. In particular, 
if we look at 3-clique m percolation the result will be fragmented. 

It was shown by [14] that the size of the final active nodes in the WTM can be derived analytically for 
this class of network. Adapting the edge-based compartmental modeling approach leads to a model which 
predicts the dynamics as well. The resulting model is relatively simple, but relies on a large number of 
variables. To refine this to its purest form, we further simplify this network class, assuming that kj = 0 for 
all nodes, so there are no independent edges. All edges appear as part of a triangle. An example is shown in 
figure [5] We use Pa^a,?") to define the joint probability of being part of A; a triangles and having a given 
r. We define ip r ,A{x) = PA(kA,r)x k ^. 

We again take u to be a test node. We consider a triangle involving u, v, and w. Taking £(t), £*(t), and 
£**(A) to be the probabilities that the neighbors in the triangle have combined to transmit 0, 1, or 2 times 
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Figure 5: A portion of a random clustered network. For simplicity we choose networks with no independent 
edges. 


to u we have 


cm =SE 

r>0 m—0 



To interpret this equation, note that the sum over r > 0 considers all thresholds for nodes which are not 
initially active. The sum over to represents the number of triangles that have transmitted to u. We have 
ipl^ (£)/m! is the probability that a node has a given r and has had exactly to triangles transmit. Of those 
to triangles we take d to be the number which have transmitted twice. So given to and d a node has received 
to + d transmissions. So long as d is at most r — m — 1, m + d is at most r — 1. The probability of a given 
to and d occurring is (™)£r _d £**- 
Once we have Q we find 

A = 1 — Q . 


That is if an individual is not counted in the A class it is considered in the Q class. 

To calculate the £ variables, we will need to know <5o, the probability that w is quiescent given that u 
and v are prevented from transmitting to w. and di, the probability that w would still be quiescent even if 
v has transmitted to it (and u has not). These are 



We now introduce some auxiliary variables which help to find the £ variables. We assume u and v are 
prevented from transmitting to w. We define £q to be the probability that w is quiescent, £a the probability 
it is active but has not transmitted to u or v, C,a,u the probability it has transmitted to u but not v, (a,v the 
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Figure 6: The flow diagram for the random clustered networks 



Figure 7: Flow diagram for the £ variables. 
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probability it has transmitted to v but not u, and (a,uv the probability it has transmitted to both. Figure [7] 
demonstrates the flow of these variables. We have 

0 Q = S 0 

Ca,u = PC A PC A,u 
Ca,v = PCa PCa,v 

C A,uv — P(Ca,u T C A,v) 

C A = 1 Cq Ca,u Ca,v Cu,v ■ 

We now introduce our final set of variables. We define 4>ab to be the probability of having one neighbor of 
status A and the other of status B. We use Q* in the subscript of (f) to denote a quiescent neighbor that 
has received transmission from the other neighbor. We use * or ** in the superscript of 4> to denote that u 
has received 1 or 2 transmissions from the neighbors in the triangle. We can find most of these variables in 
terms of the ( variables. The others (Paa, ^aai an d 0aa) can be found in terms of the £ variables, following 
figure [Gj 

0qqW =Sq=Cq 

<t>QA(t) = 2SqCa 
< t>Q*l{t) = 2 S!Ca,v 

pQ A {t ) = 2<5oCa,u 
r Q *At) = 2 s^m, 

(t>AA{t) = C~ &QQ - P'QA - 4>Q*I 
PaaW = C* - <t>QA - Pq* I 

4>aaV) = £** • 

We can now write down our final differential equations for the derivatives of the £ variables 

C = ~P{ ( t ) QA + C>q*i + 2 4>aa) 

C* = P{4>qa + 4>q-i + 2 4>aa ) — P^AA 
£** = P<t>*AA ■ 

We now have a closed system of equations. We could replace the £** equation with £** = 1 — £ — £*. 
Solutions to our equations are shown in figure [8] The generalization of our approach to nonzero ki would 
be straightforward but technical. 

4.1 Discrete time 

We now consider the discrete time version of the spread in these triangle-based networks. The dynamics are 
simplified because we do not need to consider cases where a node has transmitted to some, but not all, of 
its neighbors. Figure [9] shows how the model changes for the discrete case. 

The non-ODE equations remain the same except that Ca.u = CA,v = 0, which implies 4>q*i = 4>qa = 0- 
The ODEs are replaced. We have C Apt) = 1 — 5o(t) — C A,uv(t) and 

C{t + 1) = C(t) - (j>Q A (t) - (f’AA (t) 

C*(t + 1) = C*{t) + <j)QApt) — (j)* AA 

Ca,uv + 1 ) = 1 ~ So(t) . 


Once we have ( A ,uv(t + 1), C{t + 1), and £*(f + 1) we can build up all of the other <p variables at time t + 1. 
Then we can update for the next time. 
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Figure 8: Comparison of simulations (solid) and predictions (dashed) for different sized initial conditions 
in a network with P A ( 1,2) = P A (2,2) = P A (3,2) = 1/3 (left) and P A (1,1) = P A (2,2) = P A (3,3) = 1/3 
(right) in populations of 400000 nodes. Note that the degree of a node is twice its number of triangles fc A . 
In both cases there is a hybrid phase transition occurring for large enough initial condition. In one case, a 
system predicted to be just above the cascade threshold failed to form a cascade due to stochastic chance. 
Close to this threshold, stochastic effects are important and the simulations do not exactly match prediction. 
With larger populations the difference would shrink. 



Figure 9: The flow diagram for the discrete time (synchronous update) random clustered network model. At 
each time step any active node transmits to all of its neighbors. The diagram is somewhat simpler than in 
the continuous time (asynchronous update) case. Only those edges which transit between £, £* and £** are 
labeled as they are needed to determine the values of the £ variables. The other variables can be determined 
in terms of the </, 5, and £ variables. 
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4.2 Threshold 


To understand the threshold, the initial analytic method presented for the configuration model case becomes 
more complicated, so we consider just the second method, calculating the probability that the initial node 
manages to activate others arbitrarily far from it. Our result shows that global cascades are possible from 
arbitrarily small initial conditions (p —> 0). This shows that the conjecture of [12] that global cascades only 
occur if r = 1 nodes percolate does not generalize to networks with clustering. This is not a particular 
surprise: previous researchers have seen this sort of behavior : by activating a single node and all of its 
neighbors (using “cluster seeding”) they can initiate a global cascade [5]. An analytic result of [14] used a 
different approach to derive an equivalent threshold to ours for these networks from which the contradiction 
can also be deduced. 

Consider a triangle containing a single active node u early in the spread and determine the probability 
that the activation results in 1 or 2 further transmissions to others v and w in the triangle. It will transmit to 
both of them directly, but it will only directly cause activation if r = 1. A neighbor has r = 1 with probability 
qi = X]fc A 1)/ (Ka) = V’i.aW/ (Aa)- This results in an additional k(k — l)P(k 1 1)/ Y kP(k , 1) = 

ip" A(1)/a (1) triangles with disease introduced for each r = 1 neighbor. So from the two other nodes in the 
triangle, there are an expected 2qi^" A (l)/^j A (l) = 2i/j" A (l)/ (AT) new triangles with a single introduced 
activation because of the nodes having r = 1. 

However, there is an additional case to consider. If one neighbor has r = 1 and the other has r = 2, 
then the r = 2 node will become active. Early in the spread, an r = 2 neighbor will eventually become 
active from transmissions within the triangle iff the other neighbor has r = 1. Thus the probability that 
one node has r = 2 and becomes active is the probability that one has r = 2 and the other has r = 1. 
That is, 2qiq 2 where q 2 = Yk^ kP(k a , 2)/ (Ka)- The resulting number of triangles from the r = 2 node is 
Yk k A (k A — 1)T(A;a,2)/ (Ka)- So the expected number of new triangles coming through r = 2 nodes is 

2<^" a (1)/(Aa) = 2^ a (1)V-" a (1)/«Aa> 2 . 

So the epidemic threshold occurs when 

{Ka) + (K A f - ’ 

with a population-scale cascade possible if the left hand side is larger. If r values are assigned independently 
of k , this reduces to 

Ui-Ka) 

2P(r = 1)[1 + P(r = 2)] X ' = 1 - 

This condition is consistent with that found by m- 

4.2.1 Condition for hybrid phase transition 

Because of the added variables, it is much more difficult to prove the existence of an “interior” fixed point 
and the saddle-node bifurcation. We will not attempt to prove as strong of a result here. We anticipate that 
the conditions for the hybrid phase transition will be similar. 

We assume that there is no cascade in the p —> 0 limit. We consider a configuration model network for 
which k has the same distribution as &a ■ The active proportion in the configuration model is less than that 
of the corresponding triangle network. This is because being joined to two nodes in a triangle increases the 
probability that at least one transmission has come, and it also opens the possibility of two transmissions. 
If this configuration model network has a hybrid phase transition, then the triangle-based network must as 
well. This is a fairly crude bound. 

4.3 Impact of clustering 

To investigate the role of clustering, we want to compare the spread in triangle-based networks and config¬ 
uration model networks with the same degree distribution. Note that in a triangle-based network, a node 
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with a given fc A has degree 2 fc A , so in the configuration model networks, k has the same distribution as 2k a 
in the triangle-based networks. We find ip r ,CM(x) = Vv, a(x 2 )- 

The condition for a cascade from arbitrarily small p in a configuration model network is that ip'/ CM (1) / (AT) > 
1. In this case, we have a cascade in the Configuration Model network if 

1 < , 0i,cm(1)/ { K ) 

2^ )A (1)+4<, A (1) 

2 (A A ) 

^ !A (1) + 2< iA ( 1 ) 

(Ka) 

If the initially active nodes are randomly chosen, ^ A (l) = P(r = 1) (A' a ) and ip 2 A (1) = P( r = 

1) (AT a — K). So this becomes 


1 < 2 P(r = 


1) 



(K) 


+ P(r = 1). 


If we instead consider the threshold for a corresponding triangle-based network, it is 


1 < 2 P(r = 1) 


A'i - Ka 
(Ka) 


+ 2 P(r = 1 )P(r = 2) 


( k a - K a ) 


(Ka) 


If P(r = 2) = 0, then cascades from arbitrarily small initial conditions are inhibited in the triangle-based 
network compared to the configuration model. However, if 2 P(r = 2) (k'a — Ka) /(Ka) > 1, cascades 
from arbitrarily small initial conditions are enhanced in the triangle-based network in the sense that the 
threshold is reduced. 

We can interpret this result physically by considering an r = 1 node u which receives transmission from a 
neighbor v early in the spread. We assume u and v share a common neighbor w , and we are guaranteed that 
v will transmit to w. In the absence of r = 2 nodes, either the v transmission will infect w or the combined 
u and v transmissions are not enough to infect w. At early time, the u-w edge thus has no impact on the 
spread regardless of w’s threshold. So for initiating a cascade from a small p it is irrelevant. In contrast in 
an unclustered network, if u has the same degree, then it will have one more neighbor it can transmit to, 
which may have r = 1. 


4.3.1 A simple example where clustering inhibits spread 

We can find a simple case where clustering inhibits cascade spread. If we set r u = k u — 1 for every node, 
then no triangle can be invaded. More generally if a network has a fco-clique and every node in this clique 
has r u > k u — ko + 1, then the clique cannot be invaded. To see this, assume u is the first node in the clique 
to activate. It has ko — 1 neighbors in the clique that we are sure are not yet active, so at most k u — ko + 1 
of its neighbors are active. This contradicts the assumption that it activates. 


5 Discussion 

We have investigated the spread of complex contagions through static networks, focusing on the Watts 
Threshold Model (WTM). We have adapted the Edge-Based Compartmental Modeling approach from SIR 
disease modeling to study the dynamic spread of these infectious processes. This leads to a compact system 
of equations for the spread through Configuration Model and triangle-based networks (as introduced by 
Newman [36] and Miller (261 ). This framework helps us to unify many disparate results about this and 
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related models under a common framework, and derive some new results about threshold conditions for 
cascades. 

In Configuration Model networks we find that activation of an arbitrarily small initial proportion can 
lead to activation of a large fraction of the population iff those nodes who require only a single transmission 
to become active ( i.e ., r = 1) form a giant component (confirming a hypothesis of [421 for this case). In 
triangle-based networks r = 2 nodes can contribute to the initiation of cascades from a small initial active 
proportion (showing that the hypothesis is not true in the presence of short cycles). 

For configuration model networks, we have found a sufficient condition for a hybrid phase transition to 
occur when there is a sufficiently large initially active population. If 

1. No node has an activation threshold r u > k u 

2. r = 1 nodes do not form a giant component, and 

3. r = k — 1 nodes do not form a giant component, 

then there is a threshold concentration of initially active nodes which will lead to a cascade. 

5.1 Limitations 

The triangle-based networks we study allow for analytic methods. However, it comes at a price that the 
structure of the networks is restricted. Triangles that share edges are very rare. If we consider fc-clique 
percolation m in other clustered networks and find a large component and r < k — 1 for all nodes in this 
component then if one fc-clique in the component becomes fully active, the process will spread throughout 
the component. The “cluster-seeding” which is often used in simulations where a single node and all of its 
neighbors are activated for the initial condition will lead to this sort of behavior. This process will not be 
captured by our random clustered network model. In essence, some of the behavior of complex contagions 
in more general clustered networks is believed to be a consequence of “wide” bridges. This is missing in our 
clustered networks. 

5.2 Possible extensions 

It is easy to adapt this model to the case where initial active nodes are targeted by degree. This simply 
involves modifying the choice of P(fc,r), noting that r = 0 corresponds to the initially active nodes. This 
appears as a change in Vv, cm(^), but we then cannot assume that ip is proportional to 1 — /?. 

We can also adapt this to account for biased mixing, where the degree of a node provides information 
about the degree of its neighbors. We assume we know P n (k'\k), that is the probability a neighbor has degree 
k' given that a node has degree k. We define 6k to be the probability that a degree fc node’s neighbor has 
not yet transmitted to it. Our function ip would then become a summation of P(k)6^, than a function of a 
single variable 6. Closely related models have been studied by [24] 

We could also adapt this to account for a network which changes in time. Assuming for example that 
existing edges end at some rate r)\. and then the freed stubs seek out new neighbors at another rate 1 J 2 , 
the methods of [321 lead us t° the new model equations. A particularly interesting limit of this would have 
r ]2 —> oo such that nodes immediately find new neighbors when their old edges end. 

It would be straightforward to add a recovered class to this model, much as has been done previously in 
SIR disease models. Some of the relevant calculations have already been performed by [303 • We could also 
consider the possibility of nodes transmitting multiple times. This would involve subdividing 1 — 6 into more 
components based on the number of transmissions that have occurred, much like the £* and £** components 
that occurred in the triangle-based network model. 

We can adapt this approach to consider multiple competing processes spreading as done by 291 T8l 135] . 
May be interesting for political opinion. For understanding the formation of political groups, it would be 
interesting to consider 4 beliefs spreading: A±, A 2 , B\ 1 and B 2 where the two A, are incompatible and the 
two Bi are incompatible, while the A and B processes are independent, with the population rewiring and 
preferentially selecting neighbors with at least one common belief. 
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